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Xho I - 4851 




Asu II - 6059 

Rsr II - 5893 
Bss HII - 5774 

Bal I - 5459 
Pst I • 5430 

Cla I - 5239 



Bgl II - 967 
Ppu Ml -984 

Dra III - 1036 
Kpn 1-1046 
Bsi Wl - 1063 

Eco Rl- 1385 



Sal I - 2095 
Mlu I - 2143 
-^Nhe I - 2152 

Apal-2167 

Aga I • 2256 
Sac II - 2674 
Bsp1407l-2843 
Sma I - 2863 

Bam HI -3145 



Eco 47III - 3471 



8540 BP MAP BY MITCHELL HEFF 11/6/92 



L=Leader,CMV=Cytdrnogalovirua promoter 
BETAsMousa beta globln major promoter 
BGHsBovlna growth hormone polyadenylation 
SV=SV40 early polyadenylation 
SV ORI=SV40 origin 



NoncutterssAscI, Avril,BspMII,Bst1 1 071 
Hindlll.l-Ppol,Maml,Munl,Nrul,Pacl,Pmel 
Pmll,Reml,SgrAI,Spol,Srfl,Sso8387l,Swal 
Xbal.Xcml 
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GACGTCGCGG CCGCIICTAGG CCTCCAAAAA AGCCTCCTCA-CTACTTCTGG AATAGCTCAG 



AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 

GGAGAATGGG CGGAACTGGG CGGAGTTAGG GGCGGGATGG GCGGAGTTAG GGGCGGGACT 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCT 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 

ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTGTA TCATATGCCA 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 

CGGTGGGAGG TCTATATAAG CAGAGCT SGG *TACG TGAACC GTCAGATCGC CTGGAGACGC 

— % 



1» 



CATCAC AGA^"CT CTCACCkT GAGGGTCCCC GCTCAGCTCC ' f GgSgCTC^T^CTGCTCTGG 

W °l io» 1108 

CTCCCAGGTG CACGATG1 GAJ GGTACCAAG GTGGAAATCA~A^ KGTACGG T GGCTGCACCA 

TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 

Worsen* Wp»- oms+»^ 3>a.^\pp lo?-Afc-*s*op ccdoo 

CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC 

AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 
TGC'GAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 
CW*TGTgG%TTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 
GGCCGTGATA TCTACGTATG ATCAGCCTCG A CTGTGCCTT CTAGTTGCCA GCCATCTGTT 
GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 

BS-rt po\ v A= 5-31 t-p 

TAATAAAATG AGGAAATTGC ATCGCATTST CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 
GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA TGCTGGGGAT 
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(cont. ) 



I Ulcer ^5- ist* , 

GCGGTGGGCT CTATGGAACC AGCTGGGGCT CGACAGCfTAT GCCAAGTACG CCCCCTATTG 
ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 
TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 
GGCAGTACAT CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCfl AGTCTCCACC 
CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 
GTAACAACTC CGCCCCATTG A'CGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 
TAAGCAGAGC 1 GGGTACGfTC CTCACATTCA GTGATCAGCA CTGAACACAG ACCC GTCGAC 
^TGIg GTTGGA gcctcatctt GCTCTTCCTT gtcgctgttg CTACGC^GT^CGCTAG^ACC 

ollOO 1_ "iTArfi-T _ 5 _ 3, I iW iT^- 

AAGGGCCCAT CGGTCTTCCC CCTGGCACCC TCCTCCAAGA GCACCTCTGG GGGCACAGCG 

GCCCTGGGCT GCCTGGTCAA GGACTACTTC CCCGAACCGG TGACGGTGTC GTGGAACTCA 

GGCGCCCTGA CCAGCGGCGT GCACACCTTC CCGGCTGTCC TACAGTCCTC AGGACTCTAC 

TCCCTCAGCA GCGTGGTGAC CGS6CCCTCC AGCAGCTTGG GCACCCAGAC CTACATCTGC 

<3<t> bp = 33>oAA- + sUAv, 

AACGTGAATC ACAAGCCCAG C&ACACCAAG GTGGACAAGA "AAGCAGAGCC CAAATCTTGT 
GACAAAACTC ACACATGCCC ACCGTGCCCA GCACCTGAAC TCCTGGGGGG ACCGTCAGTC 
TTCCTCTTCC CCCCAAAACC CAAGGACACC CTCATGATCT CCCGGACCCC TGAGGTCACA 
TGCGTGGTGG TGGACGTGAG CCACGAAGAC CCTGAGGTCA AGTTCAACTG GTACGTGGAC 
GGCGTGGAGG TGCATAATGC CAAGACAAAG CCGCGGGAGG AGCAGTACAA CAGCACGTAC 
CGTGTGGTCA GCGTCCTCAC CGTCCTGCAC CAGGACTGGC TGAATGGCAA GGAGTACAAG 
TGCAAGGTCT CCAACAAAGC CCTCCCAGCC CCCATCGAGA AAACCATCTC CAAAGCCAAA 
GGGCAGCCCC GAGAACCACA GGTGTACACC CTGCCCCCAT CCCGGGATGA GCTGACCAAG 
AACCAGGTCA GCCTGACCTG CCTGGTCAAA GGCTTCTATC CCAGCGACAT CGCCGTGGAG 
TGGGAGAGCA ATGGGCAGCC GGAGAACAAC TACAAGACCA CGCCTCCCGT GCTGGACTCC 
GACGGCTCCT TCTTCCTCTA CAGCAAGCTC ACCGTGGACA AGAGCAGGTG GCAGCAGGGG 
AACGTCTTCT CATGCTCCGT GATGCATGAG GCTCTGCACA ACCACTACAC GCAGAAGAGC 



STOP 



CTCTCCCTGT CTCCGGGTAA AtrGA^GATCC GTTAACGGTT'aCCAACTAC^ TAGACTGGAT 



5 * 



TCGTGACAAC ATGCGGCCGT GATAxCTACG TATGATCAGC CTCGS CTGTG CCTTCTAGTT 
GCCAGCCATC TGTTGTTTGC CCCTCCCCCG TGCCTTCCTT GACCCTGGAA GGTGCCACTC 
CCACTGTCCT °TTCCT^TAA^^TGAGGAAA 2 ' T^G AGGTG^T^TT 
CTATTCTGGG GGGTGGGGTG GGGCAGGACA GCAAGGGGGA GGATTGGGAA GACAATAGCA 
GGCATGCTGG GGATGCGGTG GGCTCTATGG AACCAGCTGG GGCTCGACAG 'CG^fG-GATCT 
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CCCGATCCCC 
3 Wo 



AGCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 3540 
i_ 

AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 3600 

Mouse. \b>ex~*- ^\.c.o\<* '/vw\oc~ ore /v-cre <~ — _5<o6^P 

AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 3660 

GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATCACCGAA 3720 

GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 3780 

GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 3840 

CTGACATAGT TGTGTTGGGA GCTTGGATAG CTTGGACAGC TCAGGGCTGC GATTTCGCGC ' 3900 

CAAACTTGAC GGCAATCCTA GCGTGAAGGC TGGTAGGATT TTATCCCCGC TGCCATCAT9 3960 

GTTCGACCAT TGAACTGCAT CGTCGCCGTG TCCCAAAATA TGGGGATTGG C AAGAAC GGA 4020 

GACCTACCCT GGCCTCCGCT CAGGAACGAG TTCAAGTACT TCCAAAGAAT GACCACAACC 4080 

TCTTCAGTGG AAGGTAAACA GAATCTGGTG ATTATGGGTA GGAAAACCTG GTTCTCCATT 4140 

CCTGAGAAGA ATCGACCTTT AAAGGACAGA ATTAATATAG TTCTCAGTAG AGAACTCAAA 4200 

GAACCACCAC GAGGAGCTCA TTTTCTTGCC AAAAGTTTGG ATGATGCCTT AAGACTTATT 4260 

GAACAACCGG AATTGGCAAG TAAAGTAGAC ATGGTTTGGA TAGTCGGAGG CAGTTCTGTT 4320 

TACCAGGAAG CCATGAATCA ACCAGGCCAC CTTAGACTCT TTGTGACAAG GATCATGCAG 4380 

GAATTTGAAA GTGACACGTT TTTCCCAGAA ATTGATTTGG GGAAATATAA ACTTCTCCCA 4440 

GAATACCCAG GCGTCCTCTC TGAGGTCCAG GAGGAAAAAG GCATCAAGTA TAAGTTTGAA 4 500 
5Tt>P 0Hf=<* ^ail g. 

GTCTACGAGA AGAAAGAC lTAjjC AGGAAGAT GCTTTCAAGT TCTCTGCTCC CCTCCTAAAG 4560 

CTATGCATTT TTATAAGACC ATGGGACTTT TGCTGGCTTT AGSfTCAGCCT CGACTGTGCC 4620 

TTCTAGTTGC CAGCCATCTG TTGTTTGCCC CTCCCCCGTG CCTTCCTTGA CCCTGGAAGG 4 680 

So\JiY\e cvro^jtW V>orv«»<«. poWaAewN/UA'iov* re <\ , - 3.3 I bj> 

TGCCACTCCC ACTGTCCTTT CCTAATAAAA TGAGGAAATT GCATCGCATT GTCTGAGTAG 4740 

GTGTCATTCT ATTCTGGGGG GTGGGGTGGG GCAGGACAGC AAGGGGGAGG ATTGGGAAGA 4800 



CAATAGCAGG CATGCTGGGG ATGCGGTGGG CTCTATGGAA CCAGCTGGGG CTCGAGCTAC 4860 

TkGCTTTGCT TCTCAATTTC TTATTTGCAT AATGAGAAAA AAAGGAAAAT TAATTTTAAC 4920 

ACCAATTCAG TAGTTGATTG AGCAAATGCG TTGCCAAAAA GGATGCTTTA GAGACAGTGT 4 980 

Moose, bets*. aw,\dbiv\ Vv-><v\or pr o^opr - 

TCTCTGCACA GATAAGGACA AACATTATTC AGAGGGAGTA "CCCAGAGCTG AGACTCCTAA 5040 

GCCAGTGAGT GGCACAGCAT TCTAGGGAGA AATATGCTTG TCATCACCGA AGCCTGATTC 5100 

CGTAGAGCCA CACCTTGGTA AGGGCCAATC TGCTCACACA GGATAGAGAG GGCAGGAGCC 5160 

AGGGCAGAGC ATATAAGGTG AGGTAGGATC AGTTGCTCCT CACATTTGCT TCTGACATAG 5220 

1 >*<Zec* li-= iiU> ' , iTA-ftT JSC 

TTGTGTTGGG AGCTTGGATC GATCGtogT^GGTTGAACAA GATGGATTGC ACGCAGGTTC 5280 
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TCCGGCCGCT 
CTCTGATGCC 
CGACCTGTCC 
CACGACGGGC 
GCTGCTATTG 
GAAAGTATCC 
CCCATTCGAC 
TCTTGTCGAT 
CGCCAGGCTC 
CTGCTTGCCG 
GCTGGGTGTG 
GCTTGGCGGC 
GCAGCGCATC 
GAAATGACCG 
TTCTATGAAA 
CGCGGGGATC 
GGTTACAAAT 



TGGGTGGAGA GGCTATTCGG CTATGACTGG GCACAACAGA CAATCGGCTG 

GCCGTGTTCC GGCTGTCAGC GCAGGGGCGC CCGGTTCTTT TTGTCAAGAC 

GGTGCCCTGA ATGAACTGCA GGACGAGGCA GCGCGGCTAT CGTGGCTGGC 

GTTCCTTGCG CAGCTGTGCT CGACGTTGTC ACTGAAGCGG GAAGGGACTG 

GGCGAAGTGC CGGGGCAGGA TCTCCTGTCA TCTCACCTTG CTCCTGCCGA 

ATCATGGCTG ATGCAATGCG GCGGCTGCAT ACGCTTGATC CGGCTACCTG 

CACCAAGCGA AACATCGCAT CGAGCGAGCA CGTACTCGGA TGGAAGCCGG 

CAGGATGATC TGGACGAAGA GCATCAGGGG CTCGCGCCAG CCGAACTGTT 

AAGGCGCGCA TGCCCGACGG CGAGGATCTC GTCGTGACCC ATGGCGATGC ■ 

AATATCATGG TGGAAAATGG CCGCTTTTCT GGATTCATCG ACTGTGGCCG 

GCGGACCGCT ATCAGGACAT AGCGTTGGCT ACCCGTGATA TTGCTGAAGA 

GAATGGGCTG ACCGCTTCCT CGTGCTTTAC GGTATCGCCG CTCCCGATTC 

"STOP K»«o 1 

GCCTTCTATC GCCTTCTTGA CGAGTTCTTC (TGAECGGGAC TCTGGGGTTC 
ACCAAGCGAC GCCCAACCTG CCATCACGAG ATTTC GATTC CACCGCCGCC 

V orxV^^AsWcck »0eo = i>3 bp 

GGTTGGGCTT CGGAATCGTT TTCCGGGACG CCGGCTGGAT GATCCTCCAG 
TCATGCTGGA GTTCTTCGCC CACCCCAACT TGTTTATTGC AGCTTATAAT 
AAAGCAATAG CATCACAAAT TTCACAAATA AAGCATTTTT TTCACTGCAT 



TCTAGTTGTG GTTTGTCCAA ACTCATCAAT CTATCTTATC ATGTCTGGAT CGCGGCCGCG 
ATCCCGTuGA GAGCTTGGCG TAATCATGGT CATAGCTGTT TCCTGTGTGA AATTGTTATC 
CGCTCACAAT TCCACACAAC ATACGAGCCG GAAGCATAAA GTGTAAAGCC TGGGGTGCCT 
AATGAGTGAG CTAACTCACA TTAATTGCGT TGCGCTCACT GCCCGCTTTC CAGTCGGGAA 
ACCTGTCGTG CCAGCTGCAT TAATGAATCG GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 
TTGGGCGCTC TTCCGCTTCC TCGCTCACTG ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 
GAGCGGTATC AGCTCACTCA AAGGCGGTAA TACGGTTATC CACAGAATCA GGGGATAACG 

CAGGAAAGAA CATGTGAGCA AAAGGCCAGC AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 

^ 6-?1<l- terete t\a\ ortot« Q-V rcpUc*A"io<^ 

TGCTGGCGTT iQTCCATAGG CTCCGCCCCC CTGACGAGCA r TCACAAAAAT CGACGCTCAA 

GTCAGAGGTG GCGAAACCCG ACAGGACTAT AAAGATACCA GGCGTTTCCC CCTGGAAGCT 
CCCTCGTGCG CTCTCCTGTT CCGACCCTGC CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
CTTCGGGAAG CGTGGCGCTT TCTCAATGCT CACGCTGTAG GTATCTCAGT TCGGTGTAGG 
TCGTTCGCTC CAAGCTGGGC TGTGTGCACG AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 
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Figure 2 (cont.) 



TATCCGGTAA CTATCGTCTT GAGTCCAACC CGGTAAGACA CGACTTATCG CCACTGGCAG 7140 

CAGCCACTGG TAACAGGATT AGCAGAGCGA GGTATGTAGG CGGTGCTACA GAGTTCTTGA 7200 

AGTGGTGGCC TAACTACGGC TACACTAGAA GGACAGTATT TGGTATCTGC GCTCTGCTGA 7260 

AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA GCTCTTGATC CGGCAAACAA ACCACCGCTG 7320 

GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC AGATTACGCG CAGAAAAAAA GGATCTCAAG 7380 

AAGATCCTTT GATCTTTTCT ACGGGGTCTG ACGCTCAGTG GAACGAAAAC TCACGTTAAG 7440 

GGATTTTGGT CATGAGATTA TCAAAAAGGA TCTTCACCTA GATCCTTTTA AATTAAAAAT 7500 

*J>S°r StoP 

GAAGTTTTAA ATCAATCTAA AGTATATATG AGTAAACTTG GTCTGACA GfT TAj CCAATGCT 7560 

Bet*. \o^3rt. v ~a-<J£ 

TAATCAGTGA GGCACCTATC TCAGCGATCT GTCTATTTCG TTCATCCATA GTTGCCTGAC 7620 



TCCCCGTCGT GTAGATAACT ACGATACGGG AGGGCTTACC ATCTGGCCCC AGTGCTGCAA 
TGATACCGCG AGACCCACGC TCACCGGCTC CAGATTTATC AGCAATAAAC CAGCCAGCCG 

" k CTTTATCCGC CTCCATCCAG TCTATTAATT 



GAAGGGCCGA GCGCAGAAGT GGTCCTGCAA 
GTTGCCGGGA AGCTAGAGTA AGTAGTTCGC CAGTTAATAG TTTGCGCAAC GTTGTTGCCA 
TTGCTACAGG CATCGTGGTG TCACGCTCGT CGTTTGGTAT GGCTTCATTC AGCTCCGGTT 
CCCAACGATC AAGGCGAGTT ACATGATCCC CCATGTTGTG CAAAAAAGCG GTTAGCTCCT 
TCGGTCCTCC GATCGTTGTC AGAAGTAAGT TGGCCGCAGT GTTATCACTC ATGGTTATGG 

cagcactgca taattctctt actgtcatgc catccgtaag atgcttttct gtgactggtg 
agtactcaac caagtcattc tgagaatagt gtatgcggcg accgagttgc tcttgcccgg 
cgtcaatacg ggataatacc gcgccacata gcagaacttt aaaagtgctc atcattggaa 
aacgttcttc ggggcgaaaa ctctcaagga tcttaccgct gttgagatcc agttcgatgt 
aacccactcg tgcacccaac tgatcttcag catcttttac tttcaccagc gtttctgggt 
gagcaaaaac aggaaggcaa aatgccgcaa aaaagggaat aagggcgaca cggaaatgtt 
gaataci^at| actcttcctt tttcaatatt attgaagcat ttatcagggt tattgtctca 
tgagcggata catatttgaa tgtatttaga aaaataaaca aataggggtt ccgcgcacat 
ttccccgaaa agtgccacct 



7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 



page 5 



COPYRIGHT, 1992 by Mitchell E. Reff , La Jolla, CA 



rrxac. ux 




le Copy 



Figure 3* 



page 1 



GACGTCGCGG CCGCTCTAGG CCTCCAAAAA AGCCTCCTCA CTACTTCTGG AATAGCTCAG 60 
. 6 It 

AGGCCGAGGC GGCCTCGGCC TCTGCATAAA TAAAAAAAAT TAGTCAGCCA TGCATGGGGC 120 

GGAGAATGGG CGGAA^^G^^CGGAGT^sifib G^CgJgaTgB GCGGAGTTAG GGGCGGGACT 180 

ATGGTTGCTG ACTAATTGAG ATGCATGCTT TGCATACTTC TGCCTGCTGG GGAGCCTGGG 240 

GACTTTCCAC ACCTGGTTGC TGACTAATTG AGATGCATGC TTTGCATACT TCTGCCTGCT 300 

GGGGAGCCTG GGGACTTTCC ACACCCTAAC TGACACACAT TCCACAGAAT TAATTCCCCH 360 

AGTTATTAAT AGTAATCAAT TACGGGGTCA TTAGTTCATA GCCCATATAT GGAGTTCCGC 420 

GTTACATAAC TTACGGTAAA TGGCCCGCCT GGCTGACCGC CCAACGACCC CCGCCCATTG 480 

ACGTCAATAA TGACGTATGT TCCCATAGTA ACGCCAATAG GGACTTTCCA TTGACGTCAA 540 

^ 567- hp 

TGGGTGGACT ATTTACGGTA AACTGCCCAC TTGGCAGTAC ATCAAGTOTA TCATATGCCA 600 

AGTACGCCCC CTATTGACGT CAATGACGGT AAATGGCCCG CCTGGCATTA TGCCCAGTAC 660 

ATGACCTTAT GGGACTTTCC TACTTGGCAG TACATCTACG TATTAGTCAT CGCTATTACC 720 

ATGGTGATGC GGTTTTGGCA GTACATCAAT GGGCGTGGAT AGCGGTTTGA CTCACGGGGA 780 

TTTCCAAGTC TCCACCCCAT TGACGTCAAT GGGAGTTTGT TTTGGCACCA AAATCAACGG 840 

GACTTTCCAA AATGTCGTAA CAACTCCGCC CCATTGACGC AAATGGGCGG TAGGCGTGTA 900 

JUkar*s 

CGGTGGGAGG TCTATATAAG ^AGAGCIGGG TACGTGAACC GTCAGATCGC CTGGAGACGC 960 



r.ATr.Ar.AKAT P.TC-TCAC T|AT Cj GATTTTCAG GTGCAGATTA TCAGCTTCCT GCTAATCAGT 1020 

GCTTCAGTCA TAATGTCCAG AGGAJAAATT GTTCTCTCCC AGTCTCCAGC AATCCTGTCT 1080 

GCATCTCCAG GGGAGAAGGT CACAATGACT TGCAGGGCCA GCTCAAGTGT AAGTTACATC 1140 

CACTGGTTCC AGCAGAAGCC AGGATCCTCC CCCAAACCCT GGATTTATGC CACATCCAAC 1200 

CTGGCTTCTG GAGTCCCTGT TCGCTTCAGT GGCAGTGGGT CTGGGACTTC TTACTCTCTC 1260 

ACAATCAGCA GAGTGGAGGC TGAAGATGCT GCCACTTATT ACTGCCAGCA GTGGACTAGT 1320 

AACCCACCCA CGTTCGGAGG GGGGACCAAG CTGGAAATCA AA CGTACGGT GGCTGCACCA 1380 

TCTGTCTTCA TCTTCCCGCC ATCTGATGAG CAGTTGAAAT CTGGAACTGC CTCTGTTGTG 1440 

TGCCTGCTGA ATAACTTCTA TCCCAGAGAG GCCAAAGTAC AGTGGAAGGT GGATAACGCC 1500 

CTCCAATCGG GTAACTCCCA GGAGAGTGTC ACAGAGCAGG ACAGCAAGGA CAGCACCTAC '1560 

AGCCTCAGCA GCACCCTGAC GCTGAGCAAA GCAGACTACG AGAAACACAA AGTCTACGCC 1620 

TGCGAAGTCA CCCATCAGGG CCTGAGCTCG CCCGTCACAA AGAGCTTCAA CAGGGGAGAG 1680 
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TGl ftGffi TTC AGATCCGTTA ACGGTTACCA ACTACCTAGA CTGGATTCGT GACAACATGC 
t*W J 7- 

GGCCGTGATA TCTACGTATG ATCAGCCTCG A CTGTGCCTT CTAGTTGCCA GCCATCTGTT 

1^1 *- 

GTTTGCCCCT CCCCCGTGCC TTCCTTGACC CTGGAAGGTG CCACTCCCAC TGTCCTTTCC 

TAATAAAATG AGGAAATTGC ATCGCATTGT CTGAGTAGGT GTCATTCTAT TCTGGGGGGT 

GGGGTGGGGC AGGACAGCAA GGGGGAGGAT TGGGAAGACA ATAGCAGGCA -TGCTGGGGAT 

GCGGTGGGCT CTATGGAACC AG CTGGGGCT CGACAGOTAT GCCAAGTACG CCCCCTATTG 

3.O0A 3 ao»?-|8 

ACGTCAATGA CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTATGGGACT 
TTCCTACTTG GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT 
GGCAGTACAT CAATGGGCfeT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC 
CCATTGACGT CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC 
GTAACAACTC CGCCCCATTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA 



TAAGCAGAGC TgGGTACgJtC 



C CTCACATTCA GTGATCAGCA CTGAACACAG ACCCGTCGAC 
•a STGl SGTTGGA GCCTCATCTT GCTCTTCCTT GTCGCTGTTG CTAC^C^GT_C£T5TCS ZMj. 
3TACAACTGC AGCAGCCTGG GGCTGAGCTG GTGAAGCCTG GGGCCTCAGT GAAGATGTCC 
TGCAAGGCTT CTGGCTACAC ATTTACCAGT TACAATATGC ACTGGGTAAA ACAGACACCT 
GGTCGGGGCC TGGAATGGAT TGGAGCTATT TATCCCGGAA ATGGTGATAC TTCCTACAAT 
CAGAAGTTCA AAGGCAAGGC CACATTGACT GCAGACAAAT CCTCCAGCAC AGCCTACATG 
CAGCTCAGCA GCCTGACATC TGAGGACTCT GCGGTCTATT ACTGTGCAAG ATCGACTTAC 
TACGGCGGTG ACTGGTACTT CAATGTCTGG GGCGCAGGGA CCACGGTCAC CGTCTCTGCA 
SCTAGCACCA AGGGCCCATC GGTCTTCCCC CTGGCACCCT CCTCCAAGAG CACCTCTGGG 
GGCACAGCGG CCCTGGGCTG CCTGGTCAAG GACTACTTCC CCGAACCGGT GACGGTGTCG 



TGGAACTCAG GCGCCCTGAG-' CAGCGGCGTG CACACCTTCC CGGCTGTCCT ACAGTCCTCA 



V6p CAGCGGCGTG CACACCTTCC CGGCTET 

i>o A Ac ■+ sf«p 000L o ^ 
GGACTCTACT CCCTCAGCAG CGTGGTGACC GTGCCCTCCA GCAGCTTGGG CACCCAGACC 

TACATCTGCA ACGTGAATCA CAAGCCCAGC AACACCAAGG TGGACAAGAA AGCAGAGCCC 

AAATCTTGTG ACAAAACTCA CACATGCCCA CCGTGCCCAG CACCTGAACT CCTGGGGGGA 

CCGTCAGTCT TCCTCTTCCC CCCAAAACCC AAGGACACCC TCATGATCTC CCGGACCCCT 

GAGGTCACAT GCGTGGTGGT GGACGTGAGC CACGAAGACC CTGAGGTCAA GTTCAACTGG 

TACGTGGACG GCGTGGAGGT GCATAATGCC AAGACAAAGC CGCGGGAGGA GCAGTACAAC 

AGCACGTACC GTGTGGTCAG CGTCCTCACC GTCCTGCACC AGGACTGGCT GAATGGCAAG 

GAGTACAAGT GCAAGGTCTC CAACAAAGCC CTCCCAGCCC CCATCGAGAA AACCATCTCC 
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AAAGCCAAAG GGCAGCCCCG AGAACCACAG GTGTACACCC TGCCCCCATC CCGGGATGAG 
CTGACCAAGA ACCAGGTCAG CCTGACCTGC CTGGTCAAAG GCTTCTATCC CAGCGACATC 
GCCGTGGAGT GGGAGAGCAA TGGGCAGCCG GAGAACAACT ACAAGAC C AC GCCTCCCGTG 
CTGGACTCCG ACGGCTCCTT CTTCCTCTAC AGCAAGCTCA CCGTGGACAA GAGCAGGTGG 



CAGCAGGGGA ACGTCTTCTC ATGCTCCGTG ATGCATGAGG CTCTGCACAA CCACTACACG 

CTACC 



STOP. *CK>I1 



CAGAAGAGCC TCTCCCTGTC TCCGGGTAAA ]TG2 SGATCCG TTAACGGTTA CCAAC3 



AGACTGGATT CGTGACAACA TGCGGCCGTG ATATCTACGT ATGATCAGCC TCGACTGTGC 
CTTCTAGTTG CCAGCCATCT GTTGTTTGCC CCTCCCCCGT GCCTTCCTTG ACCCTGGAAG 
GTGCCACTCC CACTGTCCTT TCCTAATAAA ATGAGGAAAT TGCATCGCAT TGTCTGAGTA 
GGTGTCATTC -"TATTCTGGGG GGTGGGGTGG ■ GGCAGGACAG CAAGGGGGAG GATTGGGAAG 



AQCTGGG GCTCGACAGC 



ACAATAGCAG GCATGCTGGG GATGCGGTGG GCTCTATGGA ACCA 

GCTGGATCTC CCGATCCCCA GCTTTGCTTC TCAATTTCTT ATTTGCATAA TGAGAAAAAA 
AGGAAAATTA ATTTTAACAC CAATTCAGTA GTTGATTGAG CAAATGCGTT GCCAAAAAGG 
ATGCTTTAGA GACAGTGTTC TCTGCACAGA 0 TAAGGACAAA CATTATTC&G AGGGAGTACC 
CAGAGCTGAG ACTCCTAAGC CAGTGAGTGG CACAGCATTC TAGGGAGAAA TATGCTTGTC 
ATCACCGAAG CCTGATTCCG TAGAGCCACA CCTTGGTAAG GGCCAATCTG CTCACACAGG 



ATAGAGAGGG CAGGAGCCAG GGCAGAGCAT ATAAGGTGAG GTAGGATCAG TTGCTCCTCA 



CATTTGCTTC TGACATAGTT GTGTT GGGAG CTTGGATAGC TTGG R.CAGCT CAGGGCTGCG 



6 4S-*j 

ATTTCGCGCC AAACTTGACG GCAATCCTAG CGTGAAGGCT GGTAGGATTT TATCCCCGCT 

GCCATcfeaG TTCGACCATT GAACTGCATC GTCGCCGTGT CCCAAAATAT GGGGATTGGC 

AAGAACGGAG ACCTACCCTG GCCTCCGCTC AGGAACGAGT TCAAGTACTT CCAAAGAATG 

ACCACAACCT CTTCAGTGGA AGGTAAACAG AATCTGGTGA TTATGGGTAG GAAAACCTGG 

P H"F ft- 3 5 £»+ i r> 13 7- cv.-».«o OLCt Jl -t- vh>p oodcv> 
TTCTCCATTC CTGAGAAGAA TCGACCTTTA AAGGACAGAA TTAATATAGT TCTCAGTAGA 

GAACTCAAAG AACCACCACG AGGAGCTCAT TTTCTTGCCA AAAGTTTGGA TGATGCCTTA 

AGACTTATTG AACAACCGGA ATTGGCAAGT AAAGTAGACA TGGTTTGGAT AGTCGGAGGC 

AGTTCTGTTT ACCAGGAAGC CATGAATCAA CCAGGCCACC TTAGACTCTT TGTGACAAGG 

ATCATGCAGG AATTTGAAAG TGACACGTTT TTCCCAGAAA TTGATTTGGG GAAATATAAA 

CTTCTCCCAG AATACCCAGG CGTCCTCTCT GAGGTCCAGG AGGAAAAAGG CATCAAGTAT 



StoP t>HPg 

AAGTTTGAAG TCTACGAGAA GAAAGACgA^ 

S>1o 



CAGGAAGATG 
1 



CTTTCAAGTT CTCTGCTCCC 
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GA "TGTGCCT TCTAGTTGCC AGCCATCTGT TGTTTGCCCC TCCCCCGTGC CTTCCTTGAC 

Bon/hoc arow'Kv Wor<v<orf»e polyouLex.ylo-'h'on = <3-3 1 b/3 
CCTGGAAGGT GCCACTCCCA CTGTCCTTTC CTAATM\AAT 'GAGGAAATTG CATCGCATTG 

TCTGAGTAGG TGTCATTCTA TTCTGGGGGG TGGGGTGGGG CAGGACAGCA AGGGGGAGGA 

TTGGGAAGAC AATAGCAGGC ATGCTGGGGA TGCGGTGGGC TCTATGGAAC CAGCTGGGGC 

TCGAGCTACT UgCTTTGCTT CTCAATTTCT TATTTGCATA ATGAGAAAAA AAGGAAAATT 

AATTTTAACA CCAATTCAGT AGTTGATTGA GCAAATGCGT TGCCAAAAAG GATGCTTTAG 

AGACAGTGTT CTCTGCACAG ATAAGGACAA ACATTATTCA GAGGGAGTAC CCAGAGCTGA 

GACTCCTAAG CCAGTGAGTG GCACAGCATT CTAGGGAGAA ATATGCTTGT CATGACCGAA 

GCCTGATTCC GTAGAGCCAC ACCTTGGTAA GGGCCAATCT GCTCACACAG GATAGAGAGG 

GCAGGAGCCA GGGCAGAGCA TATAAGGTGA GGTAGGATCA GTTGCTCCTC ACATTTGCTT 

CTGACATAGT TGTGTTGGGA GCTTGGATCG ATCCTC mfq GTTGAACAAG ATGGATTGCA 



CGCAGGTTCT CCGGCCGCTT GGGTGGAGAG GCTATTCGGC TATGACTGGG CACAACAGAC 

AATCGGCTGC TCTGATGCCG CCGTGTTCCG GCTGTCAGCG CAGGGGCGCC CGGTTCTTTT 

TGTCAAGACC GACCTGTCCG STGCCCTGAA TGAACTGCAG GACGAGGCAG CGCGGCTATC 

GTGGCTGGCC ACGACGGGCG TTCCTTGCGC AGCTGTGCTC GACGTTGTCA CTGAAGCGGG 

AAGGGACTGG CTGCTATTGG GCGAAGTGCC GGGGCAGGAT CTCCTGTCAT CTCACCTTGC 

TCCTGCCGAG AAAGTATCCA TCATGGCTGA TGCAATGCGG CGGCTGCATA CGCTTGATCC 

GGCTACCTGC CCATTCGACC ACCAAGCGAA ACATCGCATC GAGCGAGCAC GTACTCGGAT 

GGAAGCCGGT CTTGTCGATC AGGATGATCT GGACGAAGAG CATCAGGGGC TCGCGCCAGC 

CGAACTGTTC GCCAGGCTCA AGGCGCGCAT GCCCGACGGC GAGGATCTCG TCGTGACCCA 

TGGCGATGCC TGCTTGCCGA ATATCATGGT GGAAAATGGC CGCTTTTCTG GATTCATCGA 

CTGTGGCCGG CTGGGTGTGG CGGACCGCTA TCAGGACATA GCGTTGGCTA CCCGTGATAT 

TGCTGAAGAG CTTGGCGGCG AATGGGCTGA CCGCTTCCTC GTGCTTTACG GTATCGCCGC 

•Wo 9 MEO 1 

TCCCGATTCG CAGCGCATCG CCTTCTATCG CCTTCTTGAC GAGTTCTT QT GAE CGGGACT 

6 >■»-_[_ i 

CTGGGGTTCG AAATGACCGA CCAAGCGACG CCCAACCTGC CATCACGAGA TTTCGATTCC 

ACCGCCGCCT TCTATGAAAG GTTGGGCTTC GGAATtGTTT TCCGGGACGC CGGCTGGATG 

ATCCTCCAGC GCGGGGATCT CATGCTGGAG TTCTTCGCCC ACCCC&ACTT GTTTATTGCA 

GCTTATAATG GTTACAAATA AAGCAATAGC ATCACAAATT TCACAAATAA AGCATTTTTT 

5VHO cArly poly »<A««v I osi"»ov» t"c^»M- 'B3t-f> \ 
TCACTGCATT CTAGTTGTGG TTTGTCCAAA CTCATCAATC TATCTTATCA TGTCTGGftrC 

GCGGCCGCGA TCCCGTOGAG AGCTTGGCGT AATCATGGTC ATAGCTGTTT CCTGTGTGAA 

Pod'] 
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?6»bo <3.S4 + s+op coffin 
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FIGURE 4 



Leader 

-20 -15 -10 

10 Frame 1 Met Asp Phe Gin Val Gin lie lie Ser Phe Leu Leu lie Ser Ala Ser Val 
ATG GAT TTT CAG GTG CAG ATT ATC AGC TTC CTG CTA ATC AGT GCT TCA GTC 
987 996 1005 1014 1023 



-5 -1 
15 He Met Ser Arg Gly 



+1 FR1 10 

51n He Val Leu Ser Gin Ser Pro Ala He Leu Ser Ala Ser 
ATA ATG TCC AGA GGA fAA ATT GTT CTC TCC CAG TCT CCA GCA ATC CTG TCT GCA TCT 
1038 1047 1056 1065 1074 1083 



20 23 
20 Pro Gly Glu Lys Val Thr Met Thr Cys 
CCA GGG GAG AAG GTC ACA ATG ACT TGC 
1095 1104 1113 



24 CDR1 27/ 29 30 34 

Arg Ala Ser Ser Ser Val Ser Tyr He His 

AGG GCC AGC TCA AGT GTA AGT TAC ATC CAC 
1122 1131 1140 



35 FR2 40 45 49 

25 Trp Phe Gin Gin Lys Pro Gly Ser Ser Pro Lys Pro Trp He Tyr 

TGG TTC CAG CAG AAG CCA GGA TCC TCC CCC AAA CCC TGG ATT TAT 
1152 1161 1170 1179 1188 



50 CDR2 

Ala Thr Ser Asn 
GCC ACA TCC AAC 
1197 



30 



55 56 
Leu Ala Ser 
CTG GCT TCT 
1209 



57 60 FR3 65 70 

Gly Val Pro Val Arg Phe Ser Gly Ser Gly Ser Gly Thr Ser Tyr Ser 

GGA GTC CCT GTT CGC TTC AGT GGC AGT GGG TCT GGG ACT TCT TAC TCT 
1218 1227 1236 1245 1254 



75 80 85 88 

35 Leu Thr He Ser Arg Val Glu Ala Glu Asp Ala Ala Thr Tyr Tyr Cys 

CTC ACA ATC AGC AGA GTG GAG GCT GAA GAT GCT GCC ACT TAT TAC TGC 

1266 1275 1284 1293 1302 
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Gin Gin Trp 
CAG CAG TGG 
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CDR3 95 97 

40 Thr Ser Asn Pro Pro Thr 

ACT AGT AAC CCA CCC ACG 
1323 1332 



98 100 FR4 105 107 

Phe Gly Gly Gly Thr Lys Leu Glu He Lys 
TTC GGA GGG GGG ACC AAG CTG GAA ATC AAA 
1341 1350 1359 
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